Extended Faceted Taxonomies for Web Catalogs
نویسندگان
چکیده
Indexing and retrieval in Web catalogs can benefit from using faceted taxonomies. A faceted taxonomy consists of a set of facets, where each facet consists of a predefined set of terms structured by a subsumption relation. We propose two extensions of faceted taxonomies, which allow inferring conjunctions of terms that are valid in the underlying domain. We give a model-theoretic interpretation to these extended faceted taxonomies and we provide mechanisms for inferring the valid conjunctions of terms. This inference service can be exploited for preventing errors during the indexing process and for dynamically generating navigation trees that are suitable for browsing through the Web. The proposed scheme has several advantages by comparison to the hierarchical classification schemes that are currently employed by Web catalogs, namely: (a) conceptual clarity: it is easier to understand; (b) compactness: it takes less space; and (c) scalability: the update operations can be formulated easier and be performed more efficiently.
منابع مشابه
Rapid Induction of Multiple Taxonomies for Enhanced Faceted Text Browsing
In this paper we present and compare two methodologies for rapidly inducing multiple subject-specific taxonomies from crawled data. The first method involves a sentence-level words co-occurrence frequency method for building the taxonomy, while the second involves the bootstrapping of a Word2Vec based algorithm with a directed crawler. We exploit the multilingual open-content directory of the W...
متن کاملThe usability issues of faceted navigation in digital libraries
The last decade transformed faceted navigation from a “nice-to-have” into a “must-have functionality” for all online web services that contain a search functionality. All commercial websites have undergone this change from online clothing stores to travel agencies, all driven by the wish of facilitating instant access to their products. From commerce it shifted to other domains, such as librari...
متن کاملFASTAXON: A System for FAST (and Faceted) TAXONomy Design
Building very big taxonomies is a laborious task vulnerable to errors and management/scalability deficiencies. FASTAXON is a system for building very big taxonomies in a quick, flexible and scalable manner that is based on the faceted classification paradigm [4] and the Compound Term Composition Algebra [5]. Below we sketch the architecture and the functioning of this system and we report our e...
متن کاملMulti-faceted Learning for Web Taxonomies
A standard problem for internet commerce is the task of building a product taxonomy from web pages, without access to corporate databases. However, a nasty aspect of the real world is that most web-pages have multiple facets. A web page might contain information about both cameras and computers, as well as having both specification and sale data. We are interested in methods for supervised and ...
متن کاملTactics for Information Search in a Public and an Academic Library Catalog with Faceted Interfaces
This study examined a large number of searches conducted when the users are interacting with two Endeca-based faceted library catalogs (University of North Carolina at Chapel Hill [UNC] Library catalog and Phoenix Public Library [PPL] catalog). The goal is to investigate people’s search tactics with the faceted catalogs in an academic library and a public library environment. Two large data set...
متن کامل